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INTESTINAL TREFOIL PROTEINS 

Background 

The field of the invention is peptides that are 
5 useful for the diagnosis, prevention, or treatment of 
wounds, including those that are associated with a 
gastrointestinal disorder. 

Jorgensen et al. (Regulatory Peptides 2*231, 1982) 
describe a porcine pancreatic peptide, pancreatic 
10 spasmolytic peptide (PSP) . PSP was found to inhibit 

"gastrointestinal motility and gastric acid secretion in 
laboratory animal after parenteral as well as oral 
administration." It was suggested that "if the results 
in animal experiments can be confirmed in man, PSP may 
15 possess a potential utility in treatment of 
gastroduodenal ulcer diseases." 

Summary of the Invention 
In a first aspect, the invention features a 
purified nucleic acid encoding an intestinal trefoil 
20 factor (ITF) . 

In preferred embodiments, the intestinal trefoil 
factor is mammalian intestinal trefoil factor, preferably 
human, rat, bovine, or porcine intestinal trefoil factor. 
In another preferred embodiment, the purified nucleic 
25 acid encoding an intestinal trefoil factor is present 
within a vector. 

In a related aspect, the invention features a cell 
that includes a vector encoding an intestinal trefoil 
factor. 

30 In another related aspect, the invention features 

a substantially pure intestinal trefoil factor. In a 
preferred embodiment, the polypeptide is detectably 
labelled. In a related aspect, the invention features a 
therapeutic composition that includes an intestinal 
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trefoil factor and a pharmacologically acceptable 
carrier. 

In another aspect, the invention features a 
monoclonal antibody which preferentially binds (i.e., 
5 forms an immune complex with) an intestinal trefoil 
factor. In a preferred embodiment, the monoclonal 
antibody is detectably labelled. 

In a related aspect, the invention features a 
method for detecting human intestinal trefoil factor in a 

10 human patient. The method includes the steps of 

contacting a biological sample obtained from the patient 
with a monoclonal antibody which preferentially binds 
intestinal trefoil factor, and detecting immune complexes 
formed with the monoclonal antibody. In preferred 

15 embodiments the biological sample is an intestinal 
mucosal scraping, or serum. 

In a related aspect, the invention features a 
method for treating digestive disorders in a human 
patient, which method involves administering to the 

20 patient a therapeutic composition that includes an 
intestinal trefoil factor and a pharmacologically 
acceptable carrier. Additional disorders that can also 
be treated are described below. 

In another aspect, the invention features a method 

25 for detecting binding sites for intestinal trefoil factor 
in a patient. The method involves contacting a 
biological sample obtained from the patient with the 
factor, and detecting the factor bound to the biological 
sample as an indication of the presence of the binding 

30 sites in the sample. By "binding sites," as used herein, 
is meant any antibody or receptor that binds to an 
intestinal trefoil factor protein, factor, or analog. 
The detection or quantitation of binding sites may be a 
useful reflection of abnormalities of the 

35 gastrointestinal tract. 
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In another aspect, the invention features 
substantially pure trefoil factor. In preferred 
embodiments, the intestinal trefoil factor is human, 
porcine, or bovine trefoil factor. 
5 By "intestinal trefoil factor" ("ITF") is meant 

any protein that is substantially homologous to rat 
intestinal trefoil factor (Fig. 2; SEQ ID NO: 2) and which 
is expressed in the large intestine, small intestine, or 
colon to a greater extent than it is expressed in tissues 

10 other than the small intestine, large intestine, or 
colon. Also included are: allelic variations; natural 
mutants; induced mutants; proteins encoded by DNA that 
hybridizes under high or low stringency conditions to ITF 
encoding nucleic acids retrieved from naturally occurring 

is material; and polypeptides or proteins retrieved by 
antisera to ITF, especially by antisera to the active 
site or binding domain of ITF. The term also includes 
other chimeric polypeptides that include an ITF. 

The term ITF also includes analogs of naturally 

20 occurring ITF polypeptides. Analogs can differ from 
naturally occurring ITF by amino acid sequence 
differences or by modifications that do not affect 
sequence, or by both. Analogs of the invention will 
generally exhibit at least 70%, more preferably 80%, more 

25 preferably 90%, and most preferably 95% or even 99%, 
homology with all or part of a naturally occurring ITF 
sequence. The length of comparison sequences will 
generally be at least 8 amino acid residues, usually at 
least 20 amino acid residues, more usually at least 24 

30 amino acid residues, typically at least 28 amino acid 
residues, and preferably more than 3 5 amino acid 
residues. Modifications include in vivo, or in vitro 
chemical derivatization of polypeptides, e.g., 
acetylation, or carboxylation. Also included are 

35 modifications of glycosylation, e.g., those made by 
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modifying the glycosylation patterns of a polypeptide 
during its synthesis and processing or in further 
processing steps, e.g., by exposing the polypeptide to 
enzymes that affect glycosylation derived from cells that 
5 normally provide such processing, e.g., mammalian 

glycosylation enzymes. Also embraced are versions of the 
same primary amino acid sequence that have phosphorylated 
amino acid residues, e.g., phosphotyrosine, 
phosphoserine, or phosphothreonine . Analogs can differ 

10 from naturally occurring ITF by alterations of their 
primary sequence. These include genetic variants, both 
natural and induced. Induced mutants may be derived by 
various techniques, including random mutagenesis of the 
encoding nucleic acids using irradiation or exposure to 

is ethanemethylsulfate (EMS) , or may incorporate changes 

produced by site-specific mutagenesis or other techniques 
of molecular biology. See, Sambrook, Fritsch and 
Maniatis (1989), Molecular Cloning: A Laboratory Manual 
(2d ed.), CSH Press, Cold Spring Harbor, New York. Also 

20 included are analogs that include residues other than 

naturally occurring L-amino acids, e.g., D-amino acids or 
non-naturally occurring or synthetic amino acids, e.g., 0 
or y amino acids. 

In addition to substantially full-length 

25 polypeptides, the term ITF, as used herein, includes 
biologically active fragments of the polypeptides. As 
used herein, the term "fragment," as applied to a 
polypeptide, will ordinarily be at least 10 contiguous 
amino acids, typically at least 20 contiguous amino 

30 acids, more typically at least 3 0 contiguous amino acids, 
usually at least 40 contiguous amino acids, preferably at 
least 50 contiguous amino acids, and most preferably at 
least 60 to 80 or more contiguous amino acids in length. 
Fragments of ITF can be generated by methods known to 

35 those skilled in the art. The ability of a candidate 
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fragment to exhibit a biological activity of ITF can be 
assessed by methods known to those skilled in the art. 

Also included in the term "fragment" are 
biologically active ITF polypeptides containing amino 
5 acids that are normally removed during protein 

processing, including additional amino acids that are not 
required for the biological activity of the polypeptide, 
or including additional amino acids that result from 
alternative mRNA splicing or alternative protein 

10 processing events. 

An ITF polypeptide, fragment, or analog is 
biologically active if it exhibits a biological activity 
of a naturally occurring ITF, e.g., the ability to alter 
gastrointestinal motility in a mammal. 

is The invention also includes nucleic acid 

sequences, and purified preparations thereof, that encode 
the ITF polypeptides described herein, as well as 
antibodies, preferably monoclonal antibodies, that bind 
specifically to ITF polypeptides. 

20 As used herein, the term "substantially pure" 

describes a compound, e.g., a nucleic acid, a protein, or 
a polypeptide, e.g., an ITF protein or polypeptide, that 
is substantially free from the components that naturally 
accompany it. Typically, a compound is substantially 

25 pure when at least 60%, more preferably at least 75%, 
more preferably at least 90%, and most preferably at 
least 99%, of the total material (by volume, by wet or 
dry weight, or by mole per cent or mole fraction) in a 
sample is the compound of interest. Purity can be 

30 measured by any appropriate method, e.g., in the case of 

< 

polypeptides, by column chromatography, polyacrylamide 
gel electrophoresis, or HPLC analysis. 

By "isolated DNA" is meant DNA that is free of the 
genes which, in the naturally-occurring genome of the 
35 organism from which the given DNA of the invention is 
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derived, flank the DNA. The term "isolated DNA" thus 
encompasses, for example, cDNA, cloned genomic DNA, and 
synthetic DNA. A "purified nucleic acid," as used 
herein, refers to a nucleic acid sequence that is 
5 substantially free of other macromolecules (e.g., other 
nucleic acids and proteins) with which it naturally 
occurs within a cell. In preferred embodiments, less 
than 40% (and more preferably less than 25%) of the 
purified nucleic acid preparation consists of such other 

10 macromolecule. 

"Homologous," as used herein, refers to the 
subunit sequence similarity between two polymeric 
molecules, e.g., between two nucleic acid molecules, 
e.g., two DNA molecules, or two polypeptide molecules. 

is When a subunit position in both of the molecules is 
occupied by the same monomeric subunit, e.g., if a 
position in each of two DNA molecules is occupied by 
adenine, then they are homologous at that position. The 
homology between two sequences is a direct function of 

20 the number of matching or homologous positions, e.g., if 
half, e.g., 5 of 10, of the positions in two compound 
sequences are homologous then the two sequences are 50% 
homologous; if 90% of the positions, e.g., 9 of 10, are 
matched or homologous the two sequences share 90% 

25 homology. By way of example, the DNA sequences 

5'-ATTGCC-3' and 5'-TATGGC-3' share 50% homology. By 
"substantially homologous" is meant largely but not 
wholly homologous. 

The ITF proteins of the invention are resistant to 

30 destruction in the digestive tract, and can be used for 
treatment of peptic ulcer diseases, inflammatory bowel 
diseases, and for protection of the intestinal tract from 
injury caused by insults such as radiation injury or 
bacterial infection. An ITF protein, fragment, or analog 

35 can also be used to treat neoplastic cancer and, as 
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described further below, to protect (i.e., by inhibiting 
the formation of lesions) or to treat any part of the 
body from inflammation or injuries such as lesions, 
ulcerations, burns, or abrasions. 

5 In general, trefoil proteins, including ITF, are 

useful for the treatment of disorders of and damage to 
the alimentary canal, including the mouth, esophagus, 
stomach, and large and small intestine, as well as for 
the protection and treatment of tissues that lie outside 

10 the alimentary canal. The polypeptide can be used either 
to treat lesions in these areas or to inhibit the 
formation of lesions. The latter tissues include, for 
example, the external surface of the skin, the surface of 
the eye, the mucosa of the nasal passages and respiratory 

15 tract, and the genitourinary tract. 

One of the most common bacterial infections is 
caused by Helicobacter pylori (H. pylori), which leads to 
active, chronic gastritis and frequently to associated 
syndromes such as duodenal ulcer, gastric ulcer, gastric 

20 cancer, MALT lymphoma, or Menetrier's syndrome. 

Eradication of H . pylori has been shown to reduce the 
recurrence of duodenal and gastric ulcers. Furthermore, 
it has been postulated that widespread treatment of H. 
pylori will reduce the incidence of gastric carcinoma, 

25 which is the second leading cause of cancer related death 
world-wide. 

Long-standing gastritis associated with H. pylori 
infection is often associated with the expression of 
intestinal-like features in the gastric mucosa. This 

30 condition, referred to as intestinal metaplasia (IM) , may 
signal an increased risk of gastric cancer. The etiology 
of IM is unclear; it could represent a mutational 
adaptation or defense against H. pylori infection. For 
example, the metaplastic mucosa may produce mucus or 

35 other substances that create an environment that is 
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hostile to H. pylori. ITF can be used in the treatment 
of H. pylori infection and conditions associated with H. 
pylori infection (e.g., ulcers, gastric carcinoma, non- 
ulcer dyspepsia, gastritis, and esophageal lesions 
5 associated with gastro-esophageal reflux disease) . ITF 
is useful for treatment of these conditions because of 
its generally protective effect on the gastrointestinal 
tract. In addition, ITF promotes the maintenance of 
mucosal integrity. ITF can be used to inhibit adhesion 

io to or colonization of the mucosa by H. pylori. In this 
application, ITF or fragments or variants thereof which 
inhibit adhesion or colonization of the mucosa by H. 
pylori are useful. Such molecules can be identified 
using assays known to those skilled in the art, including 

15 the H. pylori binding assay described below. 

ITF may also be used promote healing of tissues 
damaged by conditions associated with H . pylori 
infection. In this regard, it is important that addition 
of trefoil proteins to wounded monolayers of confluent 

20 intestinal epithelial cells increases the rate of 

epithelial cell migration into the wound. This effect is 
enhanced by concomitant addition of mucin glycoproteins, 
the other dominant product of goblet cells. 

Just as ITF can be used to protect other parts of 

25 the gastro-intestinal tract or alimentary canal, such as 
the intestine, it can be used to protect the mouth and 
esophagus from damage caused by radiation therapy "or 
chemotherapy. ITF can also be used to protect against 
(i.e., inhibit the injury caused by) and/or to treat 

30 damage caused by alcohols or drugs generally. Additional 
tissues that can be protected or treated by an ITF 
include those listed above, which lie outside the 
alimentary canal. 

Members of the trefoil family, including ITF, can 

35 be used in the treatments discussed above. Skilled 
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artisans may review these proteins in Sands et al. (1996, 
Ann. Rev. Physiol. 58:253-273). As stated above, the 
invention encompasses biologically active fragments of 
the trefoil proteins. Fragments that retain the trefoil 
5 structure (i.e., the three loop structure) or that lie 
within regions of the protein that are highly conserved 
may prove particularly useful. Thus, such fragments can 
encompass portions of ITF from about the first cysteine 
residue involved in a disulfide bond of the three loop 

10 structure to about the last cysteine residue involved in 
a disulfide bond of the three loop structure. 

Variants of a selected trefoil protein are least 
60%, preferably at least 75%, more preferably at least 
90%, and most preferably at least 95% identical to the 

15 selected trefoil protein, preferably a human trefoil 
protein, more preferably human ITF. 

The term "identical," as used herein in reference 
to polypeptide or DNA sequences, refers to the subunit 
sequence identity between two molecules. In the case of 

20 amino acid sequences that are less than 100% identical to 
a reference sequence, the non-identical positions are 
preferably, but not necessarily, conservative 
substitutions for the reference sequence. Conservative 
substitutions typically include substitutions within the 

25 following groups: glycine, alanine; valine, isoleucine, 
leucine; aspartic acid, glutamic acid; asparagine, 
glutamine; serine, threonine; lysine, arginine; and 
phenyalanine, tyrosine. Sequence identity is typically 
measured using sequence analysis software such as the 

30 Sequence Analysis Software Package of the Genetics 
Computer Group at the University of Wisconsin 
(Biotechnology Center, 1710 University Avenue, Madison, 
WI 53705), and the default parameters specified therein. 

A variant of a selected trefoil protein preferably 

35 has the amino acids present in the naturally-occurring 
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form of the selected trefoil protein at the more highly 
conserved amino acid positions of the protein. Thus, a 
variant of human ITF preferably is identical to 
naturally-occurring human ITF at all or nearly all of the 
5 more highly conserved positions. Sequence conservation 
among trefoil proteins is evident in Table 1 of Sands et 
al. (supra) which can be used by those skilled in the art 
to identify more conserved residues. 

The invention features a method for treating or 

10 inhibiting the formation of lesions in the alimentary 
canal of a patient by administering to the patient at 
least one trefoil polypeptide, or a biologically active 
fragment thereof. The lesions typically occur in the 
mucosa of the alimentary canal, and may be present in the 

15 mouth, esophagus, stomach, or intestine of the patient. 
The lesions can be caused in several ways. For example, 
the patient may be receiving radiation therapy or 
chemotherapy for the treatment of cancer. These 
treatments typically cause lesions in the mouth and 

20 esophagus of the patient. Skilled artisans will 
recognize that it may be useful to administer the 
proteins of the invention to the patient before such 
treatment is begun. Alternatively, the lesions can be 
caused by: (1) any other drug, including alcohol, that 

25 damages the alimentary canal, (2) accidental exposure to 
radiation or to a caustic substance, (3) an infection, or 
(4) a digestive disorder including but not limited to 
non-ulcer dyspepsia, gastritis, peptic or duodenal, ulcer, 
gastric cancer, MALT lymphoma, Menetrier's syndrome, 

30 gastro-esophageal reflux disease, and Crohn's disease. 

Tissues that lie outside the alimentary canal can 
also be treated by administering to the patient at least 
one trefoil peptide, or a biologically active fragment 
thereof, in the event those tissues are damaged by 

35 inflammation, a lesion, ulcer, abrasion, burn, or other 
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wound, or are at risk of being so injured (i.e., the 
method can be carried out prophalactically) . 

The peptide that is administered may be any 
peptide in the trefoil family, such as intestinal trefoil 
5 peptide (ITF) , spasmolytic peptide (SP) , and pS2 . For 
the treatment of human patients it is expected that the 
peptide will be expressed by a human gene. However, 
eucaryotic trefoil peptides, such as those cloned from 
the rat and mouse genomes may also prove effective. 

10 These peptides may be isolated from a naturally occurring 
source or synthesized by recombinant techniques. It is 
expected that the typical route of administration will be 
oral. Determining other routes of administration, and 
the effective dosage are well within the skills of 

15 ordinary artisans and will depend on many factors known 
to these artisans. The trefoil proteins may be 
administered singly, in combination with one another, 
and/or in combination with mucin glycoprotein 
preparations. "Treatment of lesions" encompasses both 

20 the inhibition of the formation of lesion and the healing 
of lesions already formed. Biologically active fragments 
and variants of a trefoil protein, particularly ITF, 
which promote healing of lesions or inhibit the formation 
of lesions, are useful in the treatments of the 

25 invention. 

The polypeptides of the invention can also be used 
for diagnostic purposes. For example, the polypeptides 
can be used in an assay to quantitate intestinal trefoil 
factor and related polypeptides, such as fragments and 

30 analogs, in tissues, serum, and other biological samples. 
In many inflammatory bowel diseases, and potentially many 
other inflammatory conditions, the expression of ITF is 
reduced. Thus, areas where expression is relatively low 
indicate the presence of injured tissue. 
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Alternatively, the polypeptides can be bound to a 
diagnostic marker and administered to a patient. In this 
circumstance, the polypeptide would facilitate the 
distribution of the diagnostic marker within any tissue 
5 that expresses a receptor for the polypeptide. The 

diagnostic marker can be any substance that is capable of 
being detected. Those of skill in the art are well aware 
of numerous imaging agents that can be used according to 
the present invention. 
10 Other features and advantages of the invention 

will be apparent from the following description of the 
preferred embodiments thereof, and from the claims. 

Brief Description of the Drawings 
Figure 1 is a depiction of the nucleotide sequence 
15 of rat trefoil factor (SEQ ID NO:l). 

Figure 2 is a depiction of the deduced amino acid 
sequence of rat trefoil factor (SEQ ID NO:2). 

Figure 3 is a depiction of the amino acid 
sequences of rat trefoil factor, pS2 protein, and 
20 pancreatic spasmolytic polypeptide (SP) . The sequences 
are aligned to illustrate the amino acid sequence 
homology between the proteins. Dashes (-■) indicate the 
insertion of spaces which optimize alignment. Bars 
indicate sequence identity. 
25 Figure 4 depicts the disulfide bond structure 

proposed for pS2 (SEQ ID NO: 15; panel A) and PSP (SEQ ID 
NO: 16; panel B) . 

Figure 5 is a depiction of the proposed disulfide 
bond structure of rat intestinal trefoil factor (SEQ ID 
30 NO: 17). 

Figure 6 is a depiction of the nucleotide sequence 
of the human intestinal trefoil factor cDNA and the 
corresponding deduced amino acid sequence (SEQ ID NO: 3). 
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Figure 7 is a diagram depicting the strategy used 
to mutate the ITF gene in embryonic stem cells. 

Figure 8 is a graph depicting survival following 
administration of Dextran Sulfate Sodium (DSS; 2.5% w/v 
5 in drinking water for 9 consecutive days) , shown as 

Kaplan-Meier transform of probability versus days of DSS 
treatment. 

Detailed Description 
Purification and cloning of rITF 

io An inhibitor of soft agar colony formation by 

human breast carcinoma-derived BT-20 cells (ATTC HTB79) 
was isolated from cytology-positive human malignant 
effusions (Podolsky et al., Cancer Res. 4ji:418, 1988). 
The factor also inhibited soft agar colony formation by 

15 human colon carcinoma-derived HCT15 cells (ATTC-CCL225) . 
Inhibition was not observed for polyoma and murine 
sarcoma virus transformed rodent fibroblast lines. The 
isolated factor (transformed cell-growth inhibiting 
factor or TGIF) had an apparent molecular weight of 

20 110,000 Da and appeared to consist of two 55,000 Da 
subunits linked by sulfhydryl bonds. 

The purified protein was partially sequenced. The 
sequence from the amino terminal 14 amino acids was used 
to produce a set of degenerate oligonucleotide probes for 

25 screening of a rat intestinal epithelial cell cDNA 
library. 

A rat intestinal cDNA library (Lambda ZAP* II, 
Stratagene, La Jolla, CA) was produced by standard 
techniques (Ausubel et al., Eds., In Current Protocols in 
30 Molecular Biology, John Wiley & Sons, New York, 1989) 
using cells purified by the method of Weisner (J. Biol. 
Chem. 248:2536, 1973). Screening of the cDNA library 
with the fully degenerate oligonucleotide probe described 
above resulted in the selection of 21 clones. One of the 
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clones (T3411) included a core sequence which encoded a 
single open reading frame. The nucleotide sequence of 
the open reading frame and flanking DNA is presented in 
Fig. 1 (SEQ ID N0:1). The insert present in T3411 was 
5 nick translated (Ausubel et al., supra) to produce a 
radioactively labelled probe for Northern blot analysis 
of rat poly(A) + RNA. Northern analysis demonstrated that 
RNA corresponding to the cloned cDNA fragment was 
expressed in small intestine, large intestine, and 

10 kidney; no expression was detected in the lung, spleen, 
heart, testes, muscle, stomach, pancreas, or liver. In 
the tissues in which the RNA was expressed, the level was 
comparable to that of act in. 

The open reading frame of clone T3411 encoded an 

15 81 amino acid peptide (Fig. 2; SEQ ID NO: 2). Comparison 
of the sequence of the encoded peptide, referred to as 
rat intestinal trefoil factor (rITF) , to the sequence of 
proteins in the Genebank database revealed significant 
homology to human breast cancer associated peptide (pS2; 

20 Jakowlev et al,, Nucleic Acids Res. 12:2861, 1984) and 
porcine pancreatic spasmolytic peptide (PSP; Thim et al., 
Biochem. Biophys . Acta. 827 :410, 1985). Figure 3 
illustrates the homology between rITF, PSP, and pS2. 
Porcine pancreatic spasmolytic factor (PSP) and pS2 are 

25 both thought to fold into a characteristic structure 
referred to as a trefoil. A trefoil structure consists 
of three loops formed by three disulfide bonds. pS2 is 
thought to include one trefoil (Fig. 4A) , and PSP is 
thought to include two trefoils (Fig. 4B) . The region of 

30 rITF (nucleotide 114 to nucleotide 230 which encodes cys 
tophe), which is most similar to PSP and pS2 , includes 
six cysteines, all of which are in the same position as 
the cysteines which make up the trefoil in pS2 (Fig. 3). 
Five of these six cysteines are in the same position as 

35 the cysteines which form the amino terminal trefoil of 
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PSP (Fig. 3) . Figure 5 depicts the proposed disulfide 
bond configuration of rITF. 

Based on homology to PSP and pS2 (Mori et al., 
Biochem. Biophys . Res. Comm. 155 :366. 1988; Jakowlew et 
5 al., Nucleic Acids Res . 12:2861, 1984), rITF includes a 
presumptive pro-sequence (metl to ala22) in which 12 of 
22 amino acids have hydrophobic side chains. 

Production of Anti-rlTF Antibodies 

A peptide corresponding to the carboxy-terminal 

10 21 amino acids of rITF was synthesized and coupled to 
bovine serum albumin (BSA) . This conjugate (and the 
unconjugated peptide) was used to raise polyclonal 
antibodies in rabbits. All procedures were standard 
protocols such as those described in Ausubel et al. 

15 (supra) . The anti-rlTF antibodies were used in an 

indirect immunof louresce assay for visualization of rITF 
in rat tissues. Cryosections of rat tissues were 
prepared using standard techniques, and fluorescein 
labelled goat anti-rabbit monoclonal antibody (labelled 

20 antibodies are available from such suppliers Kirkegaard 
and Perry Laboratories, Gaithersberg, MD; and Bioproducts 
for Science, Inc., Indianapolis, IN) was used to detect 
binding of rabbit anti-rlTF antibodies. By this analysis 
rITF appears to be present in the globlet cells of the 

25 small intestine but not in the stomach or the pancreas. 

Cloning of Human Intestinal Trefoil Factor 
DNA encoding the rat intestinal trefoil factor can 
be used to identify a cDNA clone encoding the human 
intestinal trefoil factor (hITF) . This can be 
30 accomplished by screening a human colon cDNA library with 
a probe derived from rITF or with a probe derived from 
part of the hITF gene. The latter probe can be obtained 
from a human colon or intestinal cDNA using the 
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polymerase chain reaction to isolate a part of the hITF 
gene. This probe can then serve as a specific probe for 
the identification of clones encoding all of the hITF 
gene, 

5 Construction of a cDNA Library 

A human colon or intestinal cDNA library in AgtlO 
or Agtll, or some other suitable vector is useful for 
isolation of hITF. Such libraries may be purchased 
(Clontech Laboratories, Palo Alto, CA: HLI034a, 

10 HLI0346b) . Alternatively, a library can be produced using 
mucosal scrapings from human colon or intestine. 
Briefly, total RNA is isolated from the tissue 
essentially as described by Chirgwin et al. (Biochemistry 
18:5294, 1979; see also Ausubel et al., supra). An oligo 

15 (dT) column is then used to isolate poly (A) + RNA by the 
method of Aviv et al. (J. Mol . Biol. .134:743, 1972; see 
also Ausubel et al., supra). Double-stranded cDNA is then 
produced by reverse transcription using oligo (dT) 12 _ 18 or 
random hexamer primers (or both) . RNAseH and E. coli DNA 

20 poll are then used to replace the RNA strand with a 
second DNA strand. In a subsequent step, E . coli DNA 
ligase and T4 DNA polymerase are used to close gaps in 
the second DNA strand and create blunt ends. Generally, 
the cDNA created is next methylated with EcoRI methylase 

25 and EcoRI linkers are added (other linkers can be used 
depending on the vector to be used) . In subsequent steps 
the excess linkers are removed by restriction digestion 
and the cDNA fragments are inserted into the desired 
vector. See Ausubel et al., supra and Sambrook et al. (In 

30 Molecular Cloning: A Laboratory Manual, CSH Laboratory 
Press, Cold Spring Harbor, NY, 1990) for detailed 
protocols. 
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Useful vectors include: Agtll, XgtlO, Lambda ZAP® 
II vector, Lambda Uni-ZAP™ XR vector, all available from 
Stratagene (La Jolla, CA) . 

The cDNA library must be packaged into phage; this 
5 is most readily accomplished by use of a commercial in 
vitro packaging kit, e.g., Gigapack® II Gold or Gigapack® 
II Plus (Stratagene, La Jolla, CA) . See Ausubel et al. 
{supra) for packaging protocols and suitable host 
strains. The library is preferably amplified soon after 
10 packaging; this step generates sufficient clones for 
multiple screening of the library. See Ausubel et al. 
supra or Sambrook et al. supra for details of 
amplification protocols and procedures for storing the 
amplified library. 

15 Screening of the cDNA Library 

To screen the library it must be placed on an 
appropriate host strain (e.g., Y1090 or Y1088 for XgtlO 
libraries, C600hf 1A for XgtlO libraries) . After plating 
the phage, plaques are transferred to nitrocellulose or 

20 nylon filters (See Ausubel et al., supra and Sambrook et 
al. supra) . The filters are then probed with cr 32 P-labelled 
nick translated probe derived from rITF. The probe is 
preferentially generated using a portion of the region of 
rITF DNA coding for the trefoil structure (nucleotides 

25 114 to 230 of SEQ ID N0:1, which encode cys32 to phe71 of 
SEQ ID NO: 2). This region is conserved between rITF, pS2 
and PSP, and it is likely that this region is conserved 
between rITF and hITF. Once a plaque is identified, 
several cycles of plaque purification are required to 

30 isolate a pure clone encoding hITF. A phage DNA 
isolation is performed and the cDNA insert can be 
subcloned into an appropriate vector for restriction 
mapping and sequencing. If the phage vector is Lambda 
ZAP® II, coinfection with helper phage allows rescue and 
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recircularization of pBluescript SIC phagemid vector 
(Stratagene, La Jolla, CA) harboring the cDNA; 
alternatively the phage clone is purified and the cDNA 
insert is subcloned into a vector suitable for 
5 restriction mapping and sequencing. If the clone does 
not contain the entire hITF gene (as assessed by homology 
to rITF and the presence of start and stop codons) , the 
library can be rescreened with the original rITF probe 
or, preferably, with a probe generated from the hITF 
10 clone obtained. If none of the clones contain the intact 
gene, it can be reconstructed from clones which bear 
overlapping fragments of hITF. 

Direct Isola tion of an hITF Probe bv PCR 

It is possible to isolate part of the hITF gene 

15 directly from the packaged library or cDNA. To isolate a 
portion of hITF directly from the packaged library, a 
pair of oligonucleotide primers and Taq polymerase are 
used to amplify the DNA corresponding to the hITF gene. 
The primers used would be approximately 15-20 nucleotides 

20 long and correspond in sequence to the 5 '-most and 3'- 
most portions of the rITF coding sequence. Friedman et 
al. (In PCR Protocols: A Guide to Methods and 
Applications, Innis et al., Eds., Academic Press, San 
Diego, CA) describe a procedure for such amplification. 

25 Briefly, phage particles are disrupted by heating; Taq 
polymerase, primers (300 pmol of each), dNTPs , and Taq 
polymerase buffer are added; and the mixture is thermally 
cycled to amplify DNA. The amplified DNA is isolated by 
agarose gel electrophoresis. The ends of the fragment 

30 are prepared for ligation into an appropriate vector by 
making them flush with T4 polymerase and, if desired, 
adding linkers. Alternatively, a restriction site may be 
engineered into the fragment by using primers which have 
sequence added to their 5' ends which sequence will 
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generate an appropriate sticky end when digested. For 
example the sequence: 5'-GGGCGGCCGC-3 ' (SEQ ID N0:4) can 
be added to the 5' end of each primer. This sequence 
includes the NotI restriction site flanked at the 5' end 
5 by the sequence: GG. The additional nucleotides prevent 
the 5' ends from denaturing and interfering with 
subsequent restriction digestion with NotI . The gel 
purified DNA of the appropriate size is next cloned into 
a cloning vector for sequencing and restriction mapping. 

10 This clone will not have the entire hITF sequence, rather 
it will be a combination of hITF (the region between the 
sequences corresponding to the primers) and rITF (the 5' 
and 3' ends which correspond to the primer sequences). 
However , this DNA can be used to generate a labelled 

15 probe (produced by nick translation or random primer 

labelling) which, since it is the correct hITF sequence , 
can be used in a high stringency screening of the library 
from which the cDNA was originally isolated. In an 
alternative approach, cDNA can be used in the above 

20 procedure instead of a packaged library. This eliminates 
the steps of modifying the cDNA for insertion into a 
vector as well as cDNA packaging and library 
amplification. Ausubel et al. supra provides a protocol 
for amplification of a particular DNA fragment directly 

25 from cDNA and a protocol for amplification from poly (A) + 
RNA. 

Identification of a Presumptive Human ITF clone 
A nick translated probe derived from rITF cDNA 
(corresponding to nucleotides 1 to 431 of SEQ ID NO:l) 
30 was used for Northern blot analysis of poly (A) + RNA 

derived from human intestinal mucosal scrapings. Probe 
hybridization and blot washing were carried out according 
to standard procedures. Probe (5 x 10 5 cpm/ml 
hybridization buffer) was hybridized to the filter at 
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45°C in 5X SSC with 30% forraamide. The filter was then 
washed at 60°C in 5X SSC with 40% formamide. Using this 
protocol, a band was clearly visible after an overnight 
exposure of the filter with an intensifying screen. This 
5 result indicated that there is sufficient homology 

between rITF and hITF to allow the use of probes derived 
from the sequence of the rITF gene for identification of 
the hITF gene. 

A human intestinal cDNA library was obtained from 

10 Clontech (Palo Alto, CA) . Alternatively, a human 
intestinal cDNA library may be produced from mucosal 
scrapings as described above. Four oligonucleotide 
probes were selected for screening the library cDNA. Two 
of the probes correspond to sequences within the region 

15 of rITF encoding the trefoil and are referred to as 
internal probes ( 5 ' -GTACATTCTGTCTCTTGCAGA-3 ' (SEQ ID 
NO : 5 ) and 5 ' -TAACCCTGCTGCTGCTGGTCCTGG-3 ' ( SEQ ID NO: 6) . 
The other two probes recognize sequences within rITF but 
outside of the trefoil encoding region and are referred 

20 to as external probes ( 5' -GTTTGCGTGCTGCCATGGAGA-3 ' (SEQ 
ID N0:7) and 5 ' -CCGCAATTAGAACAGCCTTGT-3 ' (SEQ ID N0:8). 
These probes were tested for their utility by using them 
to screen the rat intestinal cDNA library described 
above. Each of the four probes could be used to identify 

25 a clone harboring all or part of the rITF gene. This 
result indicates that these probes may be used to screen 
the human intestinal library for the presence of hITF. 

The internal probes were used as described above 
to amplify a DNA fragment from human colon library cDNA 

30 (Clontech, Palo Alto, CA) . Linkers were added to the 
isolated DNA fragment which was then inserted into 
pBluescript phagemid vector (Stratagene, La Jolla, CA) . 
The region of this clone corresponding to the sequence of 
human cDNA (i.e., not including the sequence 

35 corresponding to the internal probes) was used to make a 
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radioactively labelled probe by random oligonucleotide- 
primed synthesis (Ausbel et al., supra). This probe was 
then used to screen the human colon cDNA library. This 
screening led to the identification of 29 clones. One of 
5 these clones (HuPCR-ITF) was nick-translated to generate 
a probe for Northern analysis of poly (A) + RNA isolated 
from human intestinal mucosal scrapings. A single band 
of roughly the same size as the rat transcript 
(approximately 0.4 5 kDa) was observed. 

io Northern analysis of poly (A) + isolated from human 

tissues indicated that RNA corresponding to this probe 
was expressed in the small intestine and the large 
intestine but not in the stomach or the liver. These 
results indicate that the clone does not encode the human 

is homolog of porcine PSP. Porcine PSP is expressed in 
porcine pancreas and is not significantly expressed in 
the small or large intestine. These results also 
distinguish the cloned gene from pS2 which is expressed 
in the stomach. 

20 Figure 6 shows the nucleic acid sequence 

information for human ITF cDNA, along with the deduced 
amino acid sequence in one-letter code (SEQ ID N0:3). 
This clone was obtained by the methods described above. 

Production of hITF 

25 The isolated hITF gene can be cloned into a 

mammalian expression vector for protein expression. 
Appropriate vectors include pMAMneo (Clontech, Palo Alto, 
CA) which provides a RSV-LTR enhancer linked to a 
dexamethasone-inducible MMTV-LTR promoter, an SV40 origin 

30 of replication (allows replication in COS cells), a 
neomycin gene, and SV40 splicing and polyadenylation 
sites. This vector can be used to express the protein in 
COS cells, CHO cells, or mouse fibroblasts . The gene may 
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also be cloned into a vector for expression in drosophila 
cells using the bacoluvirus expression system. 

Purification of Intestinal Trefoil Factor 
Intestinal trefoil factor can be purified from 
5 intestinal mucosal scrapings of human, rats or any other 
species which expresses ITF (pigs and cows may provide a 
source of ITF) . The purification procedure used for PSP 
will be useful for the purification of ITF since the 
proteins are likely to be homologous, Jorgensen et al. 
10 describes a method for purification of PSP (Regulatory 
Peptides 3:207, 1982). The preferred method is the 
second approach described by Jorgensen et al. (supra). 
This method involves chromatography of SP-Sephadex C-25 
and QAE Sephadex A-25 columns (Sigma, St. Louis, MO) in 
15 acidic buffer. 

Anti-Intestinal Trefoil Factor Monoclonal 
Antibodies 

Anti-intestinal trefoil factor monoclonal 
antibodies can be raised against synthetic peptides whose 

20 sequences are based on the deduced amino acid sequence of 
cloned hITF (SEQ ID NO: 3). Most commonly the peptide is 
based on the amino-or carboxy-terminal 10-20 amino acids 
of the protein of interest (here, hITF) . The peptide is 
usually chemically cross- linked to a carrier molecule 

25 such as bovine serum albumin or keyhole limpet 

hemocyanin. The peptide is selected with the goal of 
generating antibodies which will cross-react with the 
native hITF. Accordingly, the peptide should correspond 
to an antigenic region of the peptide of interest. This 

30 is accomplished by choosing a region of the protein which 
is (1) surface exposed, e.g., a hydrophobic region or (2) 
relatively flexible, e.g., a loop region or a 0-turn 
region. In any case, if the peptide is to be coupled to 
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a carrier.it must have an amino acid with a side chain 
capable of participating in the coupling reaction. See 
Hopp et al. (Mol. Immunol. 20:483, 1983; J. Mol . Biol. 
157 : 105 . 1982) for a discussion of the issues involved in 
5 the selection of antigenic peptides. A second 

consideration is the presence of a protein homologous to 
hITF in the animal to be immunized. If such a protein 
exists, it is important to select a region of hITF which 
is not highly homologous to that homolog. 

10 For hITF, peptides that correspond to the amino- 

terminal or carboxy-terminal 15 amino acids are likely to 
be less homologous across species and exposed to the 
surface (and thus antigenic) . Thus they are preferred 
for the production of monoclonal antibodies. Purified 

15 hITF can also be used for the generation of antibodies. 

Genetic Disruption of a Trefoil Protein Impairs 

the 

Defense of Intestinal Mucosa 

As stated above, ITF is a member of the family of 
20 trefoil proteins that are expressed specifically and 

abundantly at the mucosal surface of the gastrointestinal 
tract. Other members of this family include pS2, which 
is expressed almost exclusively by foveolar cells of the 
stomach (Masiakowski et al., Nucl. Acids. Res. 10:7896, 
25 1982; Jorgensen et al., Regulatory Peptides 3.:231, 1982), 
and pancreatic spasmolytic peptide (SP) , which is 
expressed by the pancreas and by gastric antrum 
(Jorgensen et al., supra). As described above, the 
expression of these proteins is enhanced in the proximity 
30 of the injured bowel. In order to study the role of 

ITF in vivo, the gene was rendered non-functional by 
targeted disruption in mice. 
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Isolation of the Murine ITF Gene and Generatinn 
of ITF-D eficient Mice 

The murine ITF gene was isolated from a phage 
genomic library using the rat ITF cDNA sequence as a 
5 probe, and its identity was confirmed by nucleotide 
sequencing using standard techniques (Mashimo et al., 
Biochem. Biophys . Res. Comm. 210:31, 1995). 

A targeting vector for disrupting the gene by 
homologous recombination in embryonic stem (ES) cells was 

10 designed and constructed, as shown in Figure 7. The 

entire second exon (Ex2) of the murine ITF gene, which is 
contained within the Xbal-EcoRI fragment shown, was 
replaced with the neomycin resistance (neo) gene 
cassette. As the deleted sequence encodes most of the 

15 "trefoil domain," the ability of any resultant peptides 
to produce the looping structure characteristic of 
trefoil proteins is abolished. A positive-negative 
selection strategy (Mansour et al., Nature 336 ; 348 , 1988) 
was used to enrich for homologous recombination events in 

20 the embryonic stem (ES) cells by selecting for neo within 
the homologous DNA and against a herpes simplex virus 
thymidine kinase gene (hsv-tk) placed at the 3' end of 
the targeting vector. The pPNT plasmid (Tybulewicz et 
al., Cell 65:1153, 1991) was used to construct the 

25 targeting vector. The targeting vector was linearized 
with the restriction enzyme NotI and electroporated into 
pluripotent Jl ES cells (Li et al., Cell 69:915, 1992) 
under conditions previously described (Strittmatter et 
al., Cell 80:445, 1995). Disruption of the ITF gene in 

30 ES cells following homologous recombination was 

distinguished from random integration of the targeting 
vector by Southern blot analysis of genomic DNA from 
individual clones of cells digested with the restriction 
enzyme Xhol . The pITF2 probe identified a 19 kb "wild 

35 type" fragment and a 23 kb "knock out" fragment created 
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by introduction of an Xhol site upon homologous insertion 
of the targeting vector. Approximately 10% of neomycin- 
resistant ES clones were found to have undergone 
homologous ITF recombination using this method. 
5 The polymerase chain reaction (PCR) was used to 

confirm the targeted mutation as follows. A 200 bp 
region of DNA was amplified using primers spanning exon 2 
of ITF ( 5 ' -GCAGTGTAACAACCGTGGTTGCTGC-3 ' ( SEQ ID NO: 9) and 
5 ' -TGACCCTGTGTCATCACCCTGGC-3 9 (SEQ ID NO: 10)); and a 4 00 

io bp region of the neo gene was amplified with a second set 
of primers ( 5 ' -CGGCTGCTCTGATGGCCGCC-3 ' ( SEQ ID NO: 11) and 
5 ' -GCCGGCCACAGTCGATGAATC-3 ' ( SEQ ID NO: 12)) The DNA 
template for the PCR reaction was obtained from tail 
tissue. Approximately 0.5 cm of the tail was cut off each 

15 animal, and the samples were digested with proteinase-K 
(200 Ml at 0.5 mg/ml in 50 mM Tris-HCl pH 8.0 and 0.5% 
Triton X-100; Sigma, St. Louis, MO) at 55°C overnight. 
One Ml of this mixture was added directly to a 25 fil PCR 
reaction (per Stratagene, Menosha, WI). The reaction was 

20 begun with a "hot start" (incubation at 96°C for 10 

minutes), and the following cycle was repeated 30 times: 
72 °C for 120 seconds (hybridization and elongation) and 
96 °C for 30 seconds (denaturation) . Ten /xl of each 
reaction mixture was electrophoresed on a 2% agarose gel. 

25 Wild type animals were identified by the presence of a 
200 bp fragment, corresponding to an intact ITF gene, 
heterozygous animals were identified by the presence of 
this band and, in addition, a 400 bp fragment produced by 
amplification of the neo gene, and ITF-def icient (knock 

30 out) animals were identified by the presence of only the 
fragment corresponding to the neo gene. 

Two ES clones, which arose independently, were 
used to derive two lines of mice lacking ITF. These mice 
were screened by Southern genomic blot analysis as 

35 described for ES clones, or by PCR. 
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Analysis of Tr efoil Peptide Expression in 
Wild Type and Mut ant Mice 

Although expression of ITF is abolished in the 
mutant mice, expression of other trefoil genes is 
5 preserved. Northern blot analysis was performed using 
cDNA probes for ITF (Suemori et al., Proc. Natl. Acad. 
Sci. USA 8£:H017 # 1991), SP (Jeffrey et al. 
Gastroenterology 106:336, 1994), and, as a positive 
control, glyceraldehyde 3^-phosphate dehydrogenase 

10 (GAPDH) . The nucleic acid probe for murine pS2 was made 
by reverse transcription-polymerase chain reaction (RT- 
PCR) using the oligonucleotide pairs: 
5'-GAGAGGTTGCTGTTTTGATGACA-3' (SEQ ID NO: 13) and 
5 ' -GCCAAGTCTTGATGTAGCCAGTT-3 " (SEQ ID NO: 14), which were 

15 synthesized based on the published mouse pS2 cDNA 

sequence (GenBank Accession Number: Z218 58) . The GeneAmp 
RNA PCR Kit (Perkin Elmer) was used according to the 
manufacturer's instructions, as was the pCR^II 
(Invitrogen) cloning vector, RNA was extracted from the 

20 following tissues from both wild type and ITF-def icient 
(knock out) mice: stomach, duodenum, terminal ileum, 
right colon, appendix, transverse colon, left colon, and 
rectum. Fifteen fig of total RNA from each sample were 
electrophoresed on a 1% agarose gel, and transferred to 

25 nitrocellulose paper. Following hybridization, washing, 
and autoradiography, wild type mice exhibited a pattern 
of tissue expression considered normal: ITF was 
expressed in the small intestine and colon, which is the 
same expression pattern seen for ITF in the rat and 

30 human. The analysis of mutant mice confirmed the lack of 
ITF expression in the gastrointestinal tract. In 
contrast, the expression of the other trefoil proteins, 
SP and pS2, are unaltered in the gastrointestinal tract 
of mutant mice. SP was expressed in the stomach and, at 

35 lower levels, in the duodenum of both wild type and 
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mutant mice. Similarly, pS2 was expressed in the stomach 
of both wild type and ITF-def icient mice. 

Immunocytochemistry Reveals that ITF is not 
Expressed in the Colon of ITF-def icient mice 
5 In order to confirm that ITF protein was not 

expressed by ITF knock out mice, immunocytochemistry was 
performed as follows. Tissue from the colon and small 
intestine was fixed in the course of perfusion, immersed 
in 4% paraformaldehyde (McLean et al., J. Histochem. 

10 Cytochem. 22:1077, 1974), and embedded in paraffin. 
Sections were collected and stained either with a 
polyclonal antibody raised against a synthetic peptide 
from the predicted 18 car boxy-terminal amino acids of 
murine ITF or a monoclonal antibody against colonic mucin 

15 (Podolsky et al., J. Clin. Invest. 77:1263, 1986). 
Primary antibody binding was visualized with a 
biotinylated secondary antibody, Avidin DH, biotinylated 
horseradish peroxidase H, and diaminobenzidine 
tetrahydrochloride reagents according to the 

20 manufacturer's instructions (VectaStain ABC, Vector 
Laboratories, Bulingame, CA) . Following 
immunocytochemistry, the sections were counter stained 
with hematoxylin and viewed. Goblet cells in the colon 
of wild type mice were immunoreactive with both 

25 antibodies, staining positively for ITF and mucin. In 
contrast, the goblet cells in the colon of ITF-def icient 
mice lacked detectable ITF but continued to express 
colonic mucin. 

Induction of Mild Colonic Epithelial Injury 
30 with Dextran Sulfate Sodium 

ITF-def icient mice derived from each ES clone 
appear to develop normally and are grossly 
indistinguishable from heterozygous and wild type litter 
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mates. Their growth is not retarded and they reach 
maturity without evident diarrhea or occult fecal blood 
loss. However, the colon of ITF-def icient mice may be 
more prone to injury than the colon of wild type mice. 
5 To investigate this hypothesis, dextran sulfate sodium 
(DSS) , which reproducibly creates mild colonic epithelial 
injury with ulceration in mice (Kim et al., Scand. J. 
Gastroent. 27:529, 1992; Wells et al., J. Acquired Immune 
Deficiency Syndrome 3:361, 1990; Okayasu et al., 

10 Gastroenterology 98:694, 1990) was administered in the 
animals' drinking water. After standardization of DSS 
effects in comparable wild type mice, a group of 20 wild 
type and 20 ITF-def icient mice (litter mates from 
heterozygous crosses, weighing > 20 grams each) were 

15 treated with 2.5% DSS in their drinking water for nine 
days. 

Although 85% of wild type mice and 100% of ITF- 
def icient mice treated with DSS demonstrate occult blood 
(using Hemoccult, Smith Kline Diagnostics, San Jose, CA) 

20 in their stool during the period of treatment, ITF- 
def icient mice were markedly more sensitive to the 
injurious effects of DSS. Fifty percent of ITF-def icient 
mice developed frankly bloody diarrhea and died (Fig. 8). 
In contrast, only 10% of wild type mice treated similarly 

25 exhibited bloody diarrhea, and only 5% died. Weight loss 
was also significantly more pronounced in ITF-def icient 
mice than wild type mice receiving DSS. 

ITF-Def icient M ice Treated with Dextran Sulfate 
Sodium (DSS ) Develop Severe Colonic Erosions 

After seven days of treatment with DSS (2.5% w/v) , 
the colons of wild type and ITF-def icient mice were 
examined histologically. Left colon transections were 
fixed in 4% paraformaldehyde, mounted in paraffin, and 
stained with hematoxylin and eosin. Multiple sites of 



30 
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obvious ulceration and hemorrhage were present in the 
colon of ITF-def icient mice f while the colons of most 
wild type mice were grossly indistinguishable from those 
of untreated mice. Histological examination of the DSS- 
5 treated ITF-def icient colon confirmed the presence of 
multiple erosions and intense inflammatory changes 
including crypt abscesses. Damage was more pronounced in 
the distal colon, i.e., the descending colon, sigmoid 
colon, and rectum, which contained large, broad areas of 

10 mucosal ulceration. When similarly inspected, mucosal 
erosions could be seen in the tissue of 80% of the DSS- 
treated wild type mice, but most were small lesions that 
also appeared to be healing, with complete re- 
epithelialization of most lesions. There was no evidence 

15 of re-epithelialization in the colons of ITF-def icient 
mice exposed to DSS. 

During the normal course of growth and 
development, intestinal epithelial cells originate from 
stem cells in the intestinal crypts and rapidly progress 

20 up the crypt and villus to be extruded from the villus 
tip within five days. After intestinal injury, the 
epithelial covering is repopulated by cells which appear 
to generate signals to heal the lesion by modulation of 
epithelial and mesenchymal cell growth and matrix 

25 formation (Poulsom et al., J. Clin. Gastroenterol. 

17:S78, 1993). In vitro evidence suggests that trefoil 
proteins play a key role in re-establishing mucosal 
integrity after injury. Despite the normal restriction 
of SP and pS2 expression to the proximal gastrointestinal 

30 tract, these trefoil proteins and ITF are abundantly 

expressed at sites of colonic injury and repair. The 
DSS model described above provides a system for testing 
the protective effects of ITF, other trefoil peptides, or 
active polypeptide fragments or variants thereof. One 

35 can administer a molecule to be tested to DSS-treated 
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mice, either wild type or ITF-def icient mice, and 
determine whether the molecule has therapeutic effects by 
performing the assays described above. 

In addition to the use of DSS, any chemical 
5 compound that is known to damage the mucosa lining the 
digestive tract can be used to assay the proteins of the 
invention. These compounds include, but are not limited 
to, alcohol, indomethacin, and methotrexate. For 
example, methotrexate (MTX) can be administered 

10 intraperiotoneally to mice at a dose of 40 mg/kg. One 
group of MTX-treated animals could be given, in addition, 
the protein in question. Various parameters, such as 
body weight, the presence of lesions in the digestive 
tract, and mortality of these animals could then be 

15 compared to equivalent measurements taken from animals 
that were not treated with the protein. 

In Situ H. pylori Binding Assay 

One method for determining whether a given protein 
(or protein fragment or variant) is useful in the 

20 prevention or treatment of diseases associated with H. 
pylori infection is to examine it in the context of an 
established animal model of H. pylori infection. One 
such model was recently developed by Falk et al. (Proc. 
Natl. Acad. Sci. USA 92:1515-1519, 1995). This model 

25 involves the use of transgenic mice that express the 

enzyme a-1 , 3/4-f ucosyltransf erase and, as a consequence, 
express Le b on the surface of mucosal cells that bound 
clinical isolates of H. pylori. If the addition of a 
protein, such as ITF, to this system reduces the level of 

30 H. pylori binding to the mucosal cell, the protein would 
be considered an inhibitor of H. pylori. More 
specifically, the assay could be carried out as follows. 
H. pylori are obtained, for example, from patients with 
gastric ulcers or chronic active gastritis, grown to 
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stationary phase, and labeled, for example with 
digoxigenin or fluorescein isothiocyanate (FITC) . The 
labeled bacteria are then exposed, together with the 
protein of interest, to frozen sections prepared from the 
5 stomach, duodenum, ileum, or liver of adult transgenic 
mice (as described above). As a control, the experiment 
could be performed in parallel using tissue from a wild 
type littermate. The sections are fixed with ice-cold 
methanol for 5 minutes, rinsed three times with wash 

10 buffer (TBS; 0.1 mM CaCl 2 , 1 mM MnCl 2 , 1 mM MgCl 2 ; 10 
minutes/cycle) , and treated with blocking buffer 
(Boehringer Mannheim; see also Falk supra) . Bacteria are 
diluted to an OD 600 of 0.05 with dilution buffer (TBS; 
0.1 mM CaCl 2 , 1 mM MnCl 2 , 1 mM MgCl 2 containing leupeptin 

is (1 fjg/ml) , aprotinin (1 /ig/ml) , [ -l-p-tosylamido-2- 
phenylethyl chloromethyl ketone (100 /xg/ml) , 
phenylmethylsulf onyl fluoride (100 /xg/ml) , and pepstatin 
A (1 /ig/ml)] and overlaid on the sections for 2 hours at 
room temperature in a humidified chamber. Slides are 

20 then washed six times in wash buffer on a rotating v 
platform (5 minutes/cycle at room temperature) . 
Digoxigenin-labeled bacteria are visualized on washed 
slides with FITC-con jugated sheep anti-digoxigenin 
immunoglobulin (Boehringer Mannheim) diluted 1:100 in 

25 histoblocking buffer. Nuclei were stained with 
bisbenzimide (Sigma) . For blocking controls, 
digoxigenin-conjugated stationary-phase bacteria can be 
suspended in dilution buffer to an OD 600 of 0.05 and 
shaken with or without Le b -HSA or Le a -HSA (final 

30 concentration, 50 ^g/ml; reaction mixture, 200 /il) for 
1 hour at room temperature. The suspension is then 
overlaid on methanol-f ixed frozen sections. 
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Use 

In the practice of the present invention, ITF may 
be administered as described below for treatment of 
peptic ulcer diseases, inflammatory bowel diseases, for 
5 protection of the intestinal tract from injury caused by 
bacterial infection, radiation injury, or other insults. 
Tissues that are not a part of the alimentary canal can 
also be treated. These tissues include the skin, the 
corneal surface of the eye, and tissues within the 

10 respiratory and genitourinary tract. The mode of 
administration, dosage, and formulation of ITF will 
depend upon the condition being treated. Further 
guidance regarding treatment regimes is given below. 
Furthermore, treatment may begin before an injury has 

15 occurred because the polypeptides and compositions of the 
invention are believed to exert a protective effect. 

Other Embodiments 
Production of Antibodies 

ITF may be used to produce monoclonal antibodies 
20 for the detection of ITF in intestinal tissue or blood 
serum by means of an indirect immunoassay. ITF may be 
detectably labelled and used in an in situ hybridization 
assay for the detection of ITF binding sites. Labels may 
include, but are not limited to, fluorescein or a 
25 radioactive ligand. 

ITF may be used to protect and stabilize other 
proteins. This protection is accomplished by forming a 
hybrid molecule in which all or part of ITF is fused to 
either the carboxy-terminus or the amino-terminus (or 
30 both) of the protein of interest. Because ITF is 

resistant to degradation in the digestive system, it will 
protect the protein of interest from such degradation. 
As a consequence, the protein of interest is likely to 



> 
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remain active in the digestive system and/or will be more 
readily absorbed in an intact form. 

Stably dimerized trefoil protein can be used in 
the methods of the invention. Such molecules can be 
5 prepared by stably crosslinking monomers of trefoil or by 
expressing a gene encoding a tandem repeat of a trefoil 
protein (e.g., ITF) or a portion thereof (e.g., a portion 
capable of forming the three loop structure 
characteristic of trefoil proteins) . Also useful in the 
10 method of the invention are trefoil proteins produced by 
chemical synthesis. 

The invention also encompasses antibodies that 
bind the polypeptides of the invention, i.e., trefoil 
polypeptides such as ITF. Antibodies that specifically 
15 recognize one or more epitopes of these polypeptides, or 
fragments thereof, are also encompassed by the invention. 
Such antibodies include but are not limited to polyclonal 
antibodies, monoclonal antibodies (mAbs) , humanized or 
chimeric antibodies, single chain antibodies, Fab 
20 fragments, F(ab') 2 fragments, fragments produced by a Fab 
expression library, anti-idiotypic (anti-Id) antibodies, 
and epitope-binding fragments of any of the above. 

The antibodies of the invention may be used, for 
example, in the detection of ITF in a biological sample 
25 and may, therefore, be utilized as part of a diagnostic 
or prognostic technique whereby patients may be tested 
for abnormal amounts of ITF. Typically, the expression 
of ITF is downregulated in the immediate vicinity of a 
lesion, such as that caused by inflammatory bowel 
30 diseases (for example, colitis). 

For the production of antibodies, various host 
animals may be immunized by injection with a peptide 
having a sequence that is present, for example, in ITF. 
Such host animals may include but are not limited to 
35 rabbits, mice, and rats, to name but a few. Various 
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adjuvants may be used to increase the immunological 
response, depending on the host species, including but 
not limited to Freund's (complete and incomplete), 
mineral gels such as aluminum hydroxide, surface active 
5 substances such as lysolecithin, pluronic polyols, 
polyanions, peptides, oil emulsions, keyhole limpet 
hemocyanin, dinitrophenol, and potentially useful human 
adjuvants such as BCG (bacille Calmette-Guerin) and 
Corynebacterium parvum. Polyclonal antibodies are 

10 heterogeneous populations of antibody molecules derived 
from the sera of the immunized animals. 

Monoclonal antibodies, which are homogeneous 
populations of antibodies to a particular antigen, may be 
obtained by any technigue which provides for the 

15 production of antibody molecules by continuous cell lines 
in culture. These include, but are not limited to, the 
hybridoma technigue of Kohler and Milstein (Wature 
256:495-497, 1975; and U.S. Patent No. 4,376,110), the 
human B cell hybridoma technigue (Kosbor et al. , 

20 immunology Today 4_:72, 1983; Cole et al., Proc. Natl. 

Acad. Sci. USA 80:2026-2030, 1983), and the EBV-hybridoma 
technigue (Cole et al. , "Monoclonal Antibodies And Cancer 
Therapy," Alan R. Liss, Inc., pp. 77-96, 1985). Such 
antibodies may be of any immunoglobulin class including 

25 IgG, IgM, IgE, IgA, IgD and any subclass thereof. The 
hybridoma producing the mAb of this invention may be 
cultivated in vitro or in vivo. Production of high 
titers of mAbs in vivo makes this the presently preferred 
method of production. 

30 In addition, technigues developed for the 

production of "chimeric antibodies" (Morrison et al., 
Proc. Natl. Acad. Sci. USA, £1: 6851-6855, 1984; Neuberger 
et al., Nature, 3.12:604-608, 1984; Takeda et al., Nature, 
114:452-454, 1985) by splicing the genes from a mouse 

35 antibody molecule of appropriate antigen specificity 



WO 97/38712 PCI7US9 7/06004 



- 35 - 

together with genes from a human antibody molecule of 
appropriate biological activity can be used. A chimeric 
antibody is a molecule in which different portions are 
derived from different animal species, such as those 
5 having a variable region derived from a murine mAb and a 
human immunoglobulin constant region. 

Alternatively, techniques described for the 
production of single chain antibodies (U.S. Patent 
4 , 946 , 778; Bird, Science 242 :423-426, 1988; Huston et 
10 al. f Proc. Natl. Acad. Sci . USA 85:5879-5883, 1988; and 
Ward et al. , 1989, Nature 334:544-546, 1989) can be 
adapted to produce single chain antibodies against 
trefoil polypeptides such as ITF. Single chain 
antibodies are formed by linking the heavy and light 
15 chain fragments of the Fv region via an amino acid 
bridge, resulting in a single chain polypeptide. 

Antibody fragments which recognize specific 
epitopes may be generated by known techniques. For 
example, such fragments include but are not limited to: 
20 the F(ab') 2 fragments which can be produced by pepsin 
digestion of the antibody molecule and the Fab fragments 
which can be generated by reducing the disulfide bridges 
of the F(ab') 2 fragments. Alternatively, Fab expression 
libraries may be constructed (Huse et al., Science, 
25 246 : 1275-1281, 1989) to allow rapid and easy 

identification of monoclonal Fab fragments with the 
desired specificity. 

These antibodies can, in turn, be utilized to 
generate anti-idiotype antibodies that "mimic" ITF, using 
30 techniques well known to those skilled in the art. (See, 
for example, Greenspan and Bona, FASEB J. 7:437-444, 
1993; and Nissinoff, J. Immunol. 147:2429-2438, 1991). 
Such neutralizing anti-idiotypes or Fab fragments of such 
anti-idiotypes can be used in diagnostic regimens to 
35 detect disorders associated with apoptotic cell death. 
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Antibodies can be humanized by methods known in 
the art. For example, monoclonal antibodies with a 
desired binding specificity can be commercially humanized 
(Scotgene, Scotland; Oxford Molecular, Palo Alto, CA) . 
5 Fully human antibodies, such as those expressed in 
transgenic animals are also features of the invention 
(Green et al., Nature Genetics 7:13-21, 1994; see also 
U.S. Patents 5,54 5,806 and 5,569,82 5, both of which are 
hereby incorporated by reference) . 



10 Administration of ITF to Protect or Treat Tissues 

Outside the Alimentary Canal 

The polypeptides of the invention, including ITF, 
analogs, and fragments thereof, as well as other trefoil 
factors, such as SP (spasmolytic polypeptide) and PS2, 

15 can be used to protect or treat tissues that are not 

found within the alimentary canal. The polypeptides can 
be used, for example, to treat any sort of wound, such as 
a lesion, an ulcer, a burn, or an abrasion on the skin, 
the surface of the eye (i.e., the cornea), or within the 

20 respiratory or genitourinary tracts. The exact nature of 
the injury and the cause of the injury need not be 
precisely defined. 

Regardless of the location of the injury (i.e., 
regardless of whether or not the injury is within the 

25 alimentary canal) , toxicity and therapeutic efficacy of a 
given compound can be determined by standard 
pharmaceutical procedures, using either cells in culture 
or experimental animals to determine the LD 50 (the dose 
lethal to 50% of the population) and the ED 50 (the dose 

30 therapeutically effective in 50% of the population) . The 
dose ratio between toxic and therapeutic effects is the 
therapeutic index and it can be expressed as the ratio 
LD 50 /ED 5O . Compounds which exhibit large therapeutic 
indices are preferred. While compounds that exhibit 
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toxic side effects may be used, care should be taken to 
design a delivery system that targets such compounds to 
the site of affected tissue in order to minimize 
potential damage to unaffected cells and, thereby, reduce 
5 side effects. 

The data obtained from the cell culture assays and 
animal studies can be used in formulating a range of 
dosage for use in humans. The dosage of such compounds 
lies preferably within a range of circulating 

10 concentrations that include the ED 50 with little or no 
toxicity. The dosage may vary within this range 
depending upon the dosage form employed and the route of 
administration utilized. For any compound used in the 
method of the invention, the therapeutically effective 

15 dose can be estimated initially from cell culture assays. 
A dose may be formulated in animal models to achieve a 
circulating plasma concentration range that includes the 
IC 50 (that is, the concentration of the test compound 
which achieves a half-maximal inhibition of symptoms) as 

20 determined in cell culture. Such information can be used 
to more accurately determine useful doses in humans. 
Levels in plasma may be measured, for example, by high 
performance liquid chromatography. 

Pharmaceutical compositions for use in accordance 

25 with the present invention may be formulated in 

conventional manner using one or more physiologically 
acceptable carriers or excipients. The pharmaceutical 
compositions can also contain mucin glycoproteins. 

Thus, the compounds and their physiologically 

30 acceptable salts and solvates may be formulated for 
administration by inhalation or insufflation (either 
through the mouth or the nose) or oral, buccal, 
parenteral, or rectal administration. 

As trefoil polypeptides are not degraded within 

35 the digestive tract, it is expected that the route of 
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administration will be oral. The polypeptide could be 
administered, for example, in the form of a tablet, 
capsule, or pill, or could be suspended in a solution, 
such as a syrup, that the patient swallows. 
5 Alternatively, the solution containing the polypeptide 
may be administered as a gastric lavage. The polypeptide 
may also be included in a solution that is administered 
as an enema, or it may be administered as a suppository. 
For oral administration, which can be used to 

10 treat injured tissue within the alimentary canal, the 
pharmaceutical compositions may take the form of, for 
example, tablets or capsules prepared by conventional 
means with pharmaceutical^ acceptable excipients such as 
binding agents (for example, pregelatinised maize starch, 

15 polyvinylpyrrolidone or hydroxypropyl methylcellulose) ; 
fillers (for example, lactose, microcrystalline cellulose 
or calcium hydrogen phosphate); lubricants (for example, 
magnesium stearate, talc or silica); disintegrants (for 
example, potato starch or sodium starch glycolate) ; or 

20 wetting agents (for example, sodium lauryl sulphate). 
The tablets may be coated by methods well known in the 
art. Liquid preparations for oral administration may 
take the form of, for example, solutions, syrups or 
suspensions, or they may be presented as a dry product 

25 for constitution with water or other suitable vehicle 
before use. Such liquid preparations may be prepared by 
conventional means with pharmaceutical^ acceptable 
additives such as suspending agents (for example, 
sorbitol syrup, cellulose derivatives or hydrogenated 

30 edible fats) ; emulsifying agents (for example, lecithin 
or acacia) ; non-aqueous vehicles (for example, almond 
oil, oily esters, ethyl alcohol or fractionated vegetable 
oils); and preservatives (for example, methyl or propyl- 
p-hydroxybenzoates or sorbic acid) . The preparations may 

35 also contain buffer salts, flavoring, coloring and 
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sweetening agents as appropriate. Preparations for oral 
administration may be suitably formulated to give 
controlled release of the active compound. 

For buccal administration, which can be used to 

5 treat injured tissue within the mouth, throat, or upper 
esophagus, the compositions may take the form of tablets 
or lozenges formulated in a conventional manner. 

For administration by inhalation, which can be 
used to treated injured tissue within the respiratory 

10 tract, the compounds for use according to the present 
invention are conveniently delivered in the form of an 
aerosol spray presentation from pressurized packs or a 
nebulizer, with the use of a suitable propellant, for 
example, dichlorodif luor omethane , trichlorof luoromethane, 

15 dichlorotetraf luoroethane, carbon dioxide or other 

suitable gas. In the case of a pressurized aerosol the 
dosage unit may be determined by providing a valve to 
deliver a metered amount. Capsules and cartridges of, 
for example, gelatin for use in an inhaler or insufflator 

20 may be formulated containing a powder mix of the compound 
and a suitable powder base such as lactose or starch. 

Compositions containing the polypeptides of the 
invention can also be formulated for parenteral 
administration by injection, for example, by bolus 

25 injection or continuous infusion. Formulations for 
injection may be presented in unit dosage form, for 
example, in ampoules or in multi-dose containers, with an 
added preservative. The compositions may take such forms 
as suspensions, solutions or emulsions in oily or aqueous 

30 vehicles, and may contain formulatory agents such as 
suspending, stabilizing and/or dispersing agents. 
Alternatively, the active ingredient may be in powder 
form for constitution with a suitable vehicle, for 
example, sterile pyrogen-free water, before use. 
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The compositions can also be formulated in rectal 
compositions such as suppositories or retention enemas, 
for example, containing conventional suppository bases 
such as cocoa butter or other glycerides. 
5 In addition to the formulations described 

previously, the compounds may also be formulated as a 
depot preparation. Such long acting formulations may be 
administered by implantation (for example subcutaneously 
or intramuscularly) or by intramuscular injection. Thus, 

10 for example, the compounds may be formulated with 

suitable polymeric or hydrophobic materials (for example 
as an emulsion in an acceptable oil) or ion exchange 
resins, or as sparingly soluble derivatives, for example, 
as a sparingly soluble salt. 

15 The compositions may, if desired, be presented in 

a pack or dispenser device which may contain one or more 
unit dosage forms containing the active ingredient. The 
pack may for example comprise metal or plastic foil, such 
as a blister pack. The pack or dispenser device may be 

20 accompanied by instructions for administration. 

The therapeutic compositions of the invention can 
also contain a carrier or excipient, many of which are 
known to skilled artisans. Excipients which can be used 
include buffers (for example, citrate buffer, phosphate 

25 buffer, acetate buffer, and bicarbonate buffer), amino 
acids, urea, alcohols, ascorbic acid, phospholipids, 
proteins (for example, serum albumin), EDTA, sodium 
chloride, liposomes, mannitol, sorbitol, and glycerol. 
The nucleic acids, polypeptides, antibodies, or 

30 modulatory compounds of the invention can be administered 
by any standard route of administration. For example, 
administration can be parenteral, intravenous, 
subcutaneous , intramuscular , intracranial , intraorbital , 
opthalmic , intraventricular , intracapsular , intraspinal , 

35 intracisternal, intraperitoneal, transmucosal, or oral. 
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The modulatory compound can be formulated in various 
ways, according to the corresponding route of 
administration. For example, liquid solutions can be 
made for ingestion or injection; gels or powders can be 
5 made for ingestion, inhalation, or topical application. 
Methods for making such formulations are well known and 
can be found in, for example, "Remington's Pharmaceutical 
Sciences. It is expected that the preferred route of 
administration will be oral. 

io It is well known in the medical arts that dosages 

for any one patient depend on many factors, including the 
general health, sex, weight, body surface area, and age 
of the patient, as well as the particular compound to be 
administered, the time and route of administration, and 

is other drugs being administered concurrently. 

Dosages for the polypeptides and antibodies of the 
invention will vary. For oral administration, the 
polypeptide can be administered in dosages of about 10 mg 
to about 500 mg. For example 10, 50, 100, 200, 250, 300, 

20 4 00, or 500 mg can be administered. These dosages can be 
administered on a periodic basis. For example, a dose 
may be taken one to four times per day. For topical 
administration, the polypeptide can be administered in 
dosages of about 1 to about 10 mg/ml within an ointment, 

25 or cream. This composition can also be administered 
periodically, if necessary. For other routes of 
administration, dosage will also vary, for example, from 
about 0.1 to 1,000 mg per application. Determination of 
the correct dosage within a given therapeutic regime is 

30 well within the abilities of one of ordinary skill in the 
art of pharmacology. 

In order to determine the efficacy of a 
polypeptide in treating a particular disorder, those of 
skill in the art can perform routine studies using any 

35 one of several well known models of injury. For example, 
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the efficacy of a polypeptide in treating injuries to the 
cornea can be carried out using the in vitro model for 
corneal wound healing described by Collin et al. (Current 
Eye Res. 14:331-339, 1995). In this model system, 
5 corneas that were unsuitable for transplantation were 
obtained from human organ donors and used within 5 days 
post-mortem. A cautery tip was used to create a linear 
non-perforating thermal burn approximately 5 mm in length 
on the cornea. The wounded corneas were immediately 

10 dissected and placed in an air/ liquid organ culture 
system (as described in Richard et al., Curr. Eye Res. 
10:739-749, 1991; and Anderson et al., Ophthalmol. Vis. 
Sci. 3:442-449, 1993). Thus, to determine the efficacy 
of a polypeptide of the invention in treating such an 

15 injury, one would simply apply the polypeptide to the 
wounded cornea, for example, by placing the polypeptide 
in the tissue culture medium, and assessing the effect of 
the polypeptide on wound healing in injured, versus 
uninjured, corneas. If additional guidance is required 

20 in assessing the wound, skilled artisans may again 
consult Collin et al. (supra), who also describes 
histochemical analysis of the wounded corneas. Protocols 
for a wound closure model using cultured rabbit corneal 
endothelial cells can also be used (for example, see 

25 Joyce et al., Invest. Ophthalmol. Vis. Sci. 31:1816-1826, 
1990) . Alternatively, to assess the efficacy of a 
polypeptide in the context of a physical wound to the 
cornea, the injury induced as described by Kessler (Curr. 
Eye Res. 14:985-992, 1995) can be used. Similarly, 

30 numerous models are available in which to test the 
efficacy of a polypeptide in preventing or healing a 
wound to the epidermis. Persons of ordinary skill in the 
art are well aware of these models and able to carry out 
the procedures described in the art without resort to 

35 undue experimentation. 



WO 97/38712 



PCT/US97/06004 



- 43 - 

Other embodiments are within the following claims. 
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SEQUENCE LISTING 

(1) GENERAL INFORMATION 
(i) APPLICANT: The General Hospital Corporation 

(ii) TITLE OF THE INVENTION: INTESTINAL TREFOIL PROTEINS 

(iii) NUMBER OF SEQUENCES: 18 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Fish & Richardson, P.C. 

(B) STREET: 225 Franklin Street 
<C) CITY: Boston 

(D) STATE: MA 

(E) COUNTRY: US 

(F) ZIP : 02110-2804 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Diskette 

(B) COMPUTER: IBM Compatible 

(C) OPERATING SYSTEM: Windows95 

(D) SOFTWARE: FastSEQ for Windows Version 2.0 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: PCT/US97/ 

(B) FILING DATE: ll-APR-1997 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 08/631,469 

(B) FILING DATE: 12-APR-1996 

(A) APPLICATION NUMBER: 08/191,352 

(B) FILING DATE: 02-FEB-1994 

(A) APPLICATION NUMBER: 08/037,741 

(B) FILING DATE: 25-MAR-1993 

(A) APPLICATION NUMBER: 07/837,192 

(B) FILING DATE: 13-FEB-1992 

(A) APPLICATION NUMBER: 07/655,965 

(B) FILING DATE: 14-FEB-1991 

(viii) ATTORNEY / AGENT INFORMATION: 

(A) NAME: Meiklejohn, Ph.D., Anita L. 

(B) REGISTRATION NUMBER: 35,283 

(C) REFERENCE / DOCKET NUMBER: 00786/322WO1 

(ix) TELECOMMUNICATION INFORMATION: 
(A) TELEPHONE: 617-542-5070 

{ B ) TELEFAX: 617-542-8906 
-(C) TELEX: 200154 

(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 431 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: genomic DNA 

(ix) FEATURE: 

(A) NAME /KEY : Coding Sequence 

(B) LOCATION: 18... 260 
5 (D) OTHER INFORMATION: 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

GAAGTTTGCG TGCTGCC ATG GAG ACC AGA GCC TTC TGG ATA ACC CTG CTG 50 
Met Glu Thr Arg Ala Phe Trp He Thr Leu Leu 
1 5 10 

10 CTG GTC CTG GTT GCT GGG TCC TCC TGC AAA GCC CAG GAA TTT GTT GGC 98 
Leu Val Leu Val Ala Gly Ser Ser Cys Lys Ala Gin Glu Phe Val Gly 
15 20 25 

CTA TCT CCA AGC CAA TGT ATG GCG CCA ACA AAT GTC AGG GTG GAC TGT 146 
Leu Ser Pro Ser Gin Cys Met Ala Pro Thr Asn Val Arg Val Asp Cys 
15 30 35 40 

AAC TAC CCC ACT GTC ACA TCA GAG CAG TGT AAC AAC CGT GGT TGC TGT 194 
Asn Tyr Pro Thr Val Thr Ser Glu Gin Cys Asn Asn Arg Gly Cys Cys 
45 50 55 

TTT GAC TCC AGC ATC CCA AAT GTG CCC TGG TGC TTC AAA CCT CTG CAA 242 
20 Phe Asp Ser Ser He Pro Asn Val Pro Trp Cys Phe Lys Pro Leu Gin 
60 " 65 70 75 

GAG ACA GAA TGT ACA TTT TGAAGCTGTC CAGGCTCCAG GAAGGGAGCT CCACACCC 298 
Glu Thr Glu Cys Thr Phe 
80 

25 TGGACTCTTG CTGATGGTAG TGGCCCAGGG TAACACTCAC CCCTGATCTG CTCCCTCGCG 358 

CCGGCCAATA TAGGAGCTGG GAGTCCAGAA GAATAAAGAC CTTACAGTCA GCACAAGGCT 418 

GTTCTAATTG CGG 431 

(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 
30 (A) LENGTH: 81 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: protein 

(v) FRAGMENT TYPE: 

35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Met Glu Thr Arg Ala Phe Trp He Thr Leu Leu Leu Val Leu Val Ala 
1,-5 10 15 

Gly Ser Ser Cys Lys Ala Gin Glu Phe Val Gly Leu Ser Pro Ser Gin 
20 25 30 

40 Cys Met Ala Pro Thr Asn Val Arg Val Asp Cys Asn Tyr Pro Thr Val 
35 40 45 

Thr Ser Glu Gin Cys Asn Asn Arg Gly Cys Cys Phe Asp Ser Ser He 
50 55 60 
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Pro Asn Val Pro Trp Cys Phe Lys Pro Leu Gin Glu Thr Glu Cys Thr 
65 70 75 80 

Phe 

(2) INFORMATION FOR SEQ ID NO: 3: 

5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 403 baBe pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

10 (ii) MOLECULE TYPE: genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 
GATGCTGGGG CTGGTCCTGG CCTTGCTGTC CTCCAGCTCT GCTGAGGAGT ACGTGGGCCT 60 
GTCTGCAAAC CAGTGTGCCG TGCCGGCCAA GGACAGGGTG GACTGCGGCT ACCCCCATGT 120 
CACCCCCAAG GAGTG CAACA ACCGGGGCTG CTGCTTTGAC TCCAGGATCC CTGGAGTGCC 180 

15 TTGGTGTTTC AAGCCCCTGA CTAGGAAGAC AGAATGCACC TTCTGAGGCA CCTCCAGCTG 240 
CCCCTGGGAT GCAGGCTGAG CACCCTTGCC CGGCTGTGAT TGCTGCCAGG CACTGTTCAT 300 
CTCAGTTTTT CTGTCCCTTT GCTCCCGGCA AGCTTTCTGC TGAAAGTTCA TATCTGGAGC 360 
CTGATGTCTT AACGAATAAA GGTCCCATGC TCCACCCGAA AAA 403 
(2) INFORMATION FOR SEQ ID NO: 4: 

20 <i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

25 <ii) MOLECULE TYPE: genomic DNA 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

GGGCGGCCGC 10 

(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 
30 (A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

. - (ii) MOLECULE TYPE: genomic DNA 
35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

GTACATTCTG TCTCTTGCAG A 21 

(2) INFORMATION FOR SEQ ID NO: 6: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
5 (D ) TOPOLOGY: linear 

(ii) MOLECULE TYPE: genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:6: 

TAACCCTGCT GCTGCTGGTC CTGG 24 

(2) INFORMATION FOR SEQ ID NO: 7: 

10 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

15 (ii) MOLECULE TYPE: genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

GTTTGCGTGC TGCCATGGAG A 21 

(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 21 base pairs 

( B ) TYPE: nucleic acid 
<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: genomic DNA 

25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:8: 

CCGCAATTAG AACAGCCTTG T 21 

(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 25 base pairs 
30 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 
35 GCAGTGTAAC AACCGTGGTT GCTGC 25 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 
40 <C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: genomic DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:10: 

TGACCCTGTG TCATCACCCT GGC 23 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 
5 (A) LENGTH : 20 base pairs 

<B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: genomic DNA 

10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

CGGCTGCTCT GATGGCCGCC 20 

(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 21 base pairs 
15 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
20 GCCGGCCACA GTCGATGAAT C 21 

(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 
25 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
GAGAGGTTGC TGTTTTGATG ACA 23 
30 (2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
35 (D ) TOPOLOGY: linear 

(ii) MOLECULE TYPE: genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

GCCAAGTCTT GATGTAGCCA GTT 23 

(2) INFORMATION FOR SEQ ID NO: 15: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 60 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

5 (ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

Glu Ala Gin Thr Glu Thr Cys Thr Val Ala Pro Arg Glu Arg Gin Asn 
1 5 10 15 

Cys Gly Phe Pro Gly Val Thr Pro Ser Gin Cys Ala Asn Lys Gly Cys 
10 20 25 30 

Cys Phe Asp Asp Thr Val Arg Gly Val Pro Trp Cys Phe Tyr Pro Asn 
35 40 45 

Thr lie Asp Val Pro Pro Glu Glu Glu Cys Glu Phe 
50 55 60 

15 (2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 62 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

20 (ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

Glu Lys Pro Ala Ala Cys Arg Cys Ser Arg Gin Asp Pro Lys Asn Arg 
15 10 15 

Val Asn Cys Gly Phe Pro Gly lie Thr Ser Asp Gin Cys Phe Thr Ser 
25 * 20 25 30 

Gly Cys Cys Phe Asp Ser Gin Val Pro Gly Val Pro Trp Cys Phe Lys 
35 40 45 

Pro Leu Pro Ala Gin Glu Ser Glu Glu Cys Val Met Glu Val 
50 55 60 

30 (2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 59 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

35 (ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

Gin Glu Phe Val Gly Leu Ser Pro Ser Gin Cys Met Ala Pro Thr Asn 
15 10 15 

Val Arg Val Asp Cys Asn Tyr Pro Thr Val Thr Ser Glu Gin Cys Asn 
40 20 25 30 
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Asn Arg Gly Cys Cys Phe Asp Ser Ser lie Pro Asn Val Pro Trp Cys 
35 40 45 

Phe Lys Pro Leu Gin Glu Thr Glu Cys Thr Phe 
50 55 

5 (2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 74 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

10 (ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

Met Leu Gly Leu Val Leu Ala Leu Leu Ser Ser Ser Ser Ala Glu Glu 
15 10 15 

Tyr Val Gly Leu Ser Ala Asn Gin Cys Ala Val Pro Ala Lys Asp Arg 
15 20 25 30 

Val Asp Cys Gly Tyr Pro His Val Thr Pro Lys Glu Cys Asn Asn Arg 
35 40 45 

Gly Cys Cys Phe Asp Ser Arg lie Pro Gly Val Pro Trp Cys Phe Lys 
50 55 60 

20 Pro Leu Thr Arg Lys Thr Glu Cys Thr Phe 
65 70 
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What is claimed is: 

I. A polypeptide comprising a trefoil polypeptide , or a 
biologically active fragment thereof, for treating or 
inhibiting the formation of lesions, 

5 2. The polypeptide of claim 1, wherein said trefoil 
polypeptide is a human trefoil polypeptide. 

3. The polypeptide of claim 1, wherein said trefoil 
polypeptide is intestinal trefoil polypeptide (ITF) . 

4. The polypeptide of claim 1, wherein said trefoil 
10 polypeptide is spasmolytic polypeptide (SP) . 

5. The polypeptide of claim 1, wherein said trefoil 
polypeptide is PS2 . 

6. The polypeptide of claim 1, wherein said lesion is 
within the alimentary canal. 

is 7. The polypeptide of claim 6, wherein said lesion is 
within the mouth of the patient. 

8. The polypeptide of claim 6, wherein said lesion is in 
the esophagus of the patient. 

9. The polypeptide of claim 6, wherein said lesion is in 
20 the stomach of the patient. 

10w -The polypeptide of claim 6, wherein said lesion is 
in the intestine of the patient. 

II. The polypeptide of claim 6, wherein said patient is 
recieving radiation therapy for the treatment of cancer. 
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12. The polypeptide of claim 6, wherein said patient is 
receiving chemotherapy, for the treatment of cancer, 

13. The polypeptide of claim 6, wherein said patient is 
recieving a drug that damages said alimentary canal. 

5 14. The polypeptide of claim 6, wherein said patient is 
suffering from a digestive disorder. 

15. The polypeptide of claim 14, wherein said digestive 
disorder is non-ulcer dispepsia. 

16. The polypeptide of claim 14, wherein said digestive 
10 disorder is gastritis. 

17. The polypeptide of claim 14, wherein said digestive 
disorder is gastro-esophageal reflux disease. 

18. The polypeptide of claim 14, wherein said digestive 
disorder is a peptic ulcer or duodenal ulcer. 

is 19. The polypeptide of claim 6, wherein said 
administration is oral administration. 

20. The polypeptide of claim 6, wherein said oral 
administration comprises administration of about 
10 milligrams to about 100 milligrams of said polypeptide. 

20 21. The polypeptide of claim 1, wherein said lesion is 
within a tissue other than a tissue in the alimentary canal. 

22. The polypeptide of claim 21, wherein said tissue is 
the skin. 
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23. The polypeptide of claim 22, wherein said 
administration is topical administration. 

24. The polypeptide of claim 23, wherein said topical 
administration comprises administration of an ointment 

5 containing about 1 mg/ml to about 10 mg/ml of the 
polypeptide. 

25. The polypeptide of claim 21, wherein said tissue 
comprises the corneal surface of the eye. 

26. The polypeptide of claim 21, wherein said tissue 
10 comprises a tissue in the respiratory tract. 

27. The polypeptide of claim 21, wherein said tissue 
comprises a tissue in the genitourinary tract. 

28. A composition comprising a trefoil polypeptide for 
treating or inhibiting the formation of lesions in the 

is tissue of a patient. 

29. The use of the composition of claim 28 in the 
manufacture of a medicament for the treatment of lesions in 
the tissue of a patient. 

30. The trefoil polypeptide of claim 1, said polypeptide 
20 further comprising a marker. 

31. The trefoil polypeptide of claim 30, wherein said 
marker comprises an imaging agent. 

32. A method for detecting an ITF receptor in a tissue, 
said method comprising contacting said tissue with 

25 detectably labelled trefoil polypeptide and measuring the 
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level of detectably labelled trefoil polypeptide bound to 
said tissue. 

33. A method for detecting a trefoil polypeptide in a 
tissue, said method comprising contacting said tissue with 
an antibody that specifically binds said trefoil 
polypeptide . 
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(SEQ ID NO: 16) 
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